CDS
Accession Number | TCMCG075C27558 |
gbkey | CDS |
Protein Id | XP_007015421.2 |
Location | complement(join(31762995..31763102,31763189..31763525,31763618..31763829,31765108..31765209,31765565..31765678,31765855..31765964,31766744..31766853,31766963..31767165,31767268..31767432,31767596..31767714,31767970..31768057,31768201..31768306,31768471..31768559,31768766..31768909,31769121..31769213,31769357..31769423,31769507..31769619,31769788..31769865,31769867..31769884,31769972..31770133)) |
Gene | LOC18590067 |
GeneID | 18590067 |
Organism | Theobroma cacao |
Protein
Length | 845aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007015359.2 |
Definition | PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 8 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | beta-galactosidase |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE | - |
KEGG_ko | - |
EC | - |
KEGG_Pathway | - |
GOs |
GO:0003674
[VIEW IN EMBL-EBI] GO:0003824 [VIEW IN EMBL-EBI] GO:0004553 [VIEW IN EMBL-EBI] GO:0004565 [VIEW IN EMBL-EBI] GO:0005575 [VIEW IN EMBL-EBI] GO:0005618 [VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005737 [VIEW IN EMBL-EBI] GO:0005773 [VIEW IN EMBL-EBI] GO:0015925 [VIEW IN EMBL-EBI] GO:0016787 [VIEW IN EMBL-EBI] GO:0016798 [VIEW IN EMBL-EBI] GO:0030312 [VIEW IN EMBL-EBI] GO:0043226 [VIEW IN EMBL-EBI] GO:0043227 [VIEW IN EMBL-EBI] GO:0043229 [VIEW IN EMBL-EBI] GO:0043231 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044444 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0071944 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGGGGAGCAGAACAAAGACTCTGGTGTTGGTTTTTTGGTTGGTTACTGCAACGACGTCGTTTGCAGCCACCGTCACGTACGATCACCGGGCGATTGTCATCGACGGGAAACGCCGTGTTTTGATCTCTGGCTCCATTCATTATCCACGCAGCACCCCTGACATGTGGCCGGACCTTATACAAAAATCGAAGGACGGAGGCTTAGATGTCATTGAAACTTACGTTTTTTGGAATTTACACGAACCAGTTAGAAACCAGTACAATTTCGAAGGAAGAAACGATTTGGTTAAATTTATAAAGTTAGTTGCAGAAGCTGGTCTCTATGTTCATCTACGCATCGGTCCGTATGCTTGCGCTGAATGGAATTATGGTGGTTTTCCTCTTTGGTTACATTTTATACCTGGAATCCAGCTACGAACTGATAATGAACCATTTAAGGCGGAAATGCAGCGGTTCACGGCTAAGATTGTGGCAATGATGAAGCAAGAGAATTTGTATGCATCACAAGGAGGACCCATTATTTTGTCACAGATTGAAAATGAGTATGGGAATATTGATTCATCATATGGGGCAGCTGCAAAACGCTACATCAAGTGGGCAGCTGGTATGGCTGTTTCCTTGGATACTGGAGTTCCCTGGGTTATGTGCCAGCAATCAGATGCTCCTGATCCCATTATCAACACCTGCAATGGTTTCTATTGCGACCAATTCACCCCAAATTCTAACAAGAAACCAAAAATGTGGACTGAGAATTGGACTGGATGGTTTCTTTCATTTGGTGGTGCTGTTCCCTACAGACCTGTAGAAGACATCGCATTTGCTGTTGCACGGTTTTTCCAAAGAGGTGGAACTTTCCAAAACTATTATATGTATCATGGTGGAACGAACTTTGGCAGGACTAGTGGTGGACCCTTTATTGCTACCAGTTATGATTATGATGCTCCAATCGATGAGTATGGACATGTTAGACAACCCAAGTGGGGTCACCTAAGAGATGTTCATAAGGCTATAAAGCTTTGCGAAGAAGCATTGATTGCCACTGATCCTACAATTTCCTCTTTGGGTCCAAACTTGGAGTCTGCTGTATATAAAACAGGATCAGGACTATGTGCCGCTTTTCTAGCCAATGTGGGCACCCAATCTGATGCGACGGTTAATTTCGACGGCAGTTCATACCATTTGCCTGCATGGTCGGTCAGCATCTTACCAGACTGCAAGAATGTAGTTCTGAATACCGCAAAGATTAACTCTATGACTGTAATTCCAAGCTTCATGCATGAACCTTTGAATATCAATGCTGATTCAACTGAGGCAATTGGGACAAGCTGGAGTTGGGTATATGAACCTGTGGGTATCTCAAAGGCTGATGCATTTAAAAAACTTGGATTGTTAGAGCAAATAAACACTACTGCTGATAAAAGCGACTACTTATGGTATTCATTTAGCACTGATATCGAAGGAGATGAGCCTTTCCTTGAAGACGGATCTCAAACTGTTCTTCATGTGGAATCACTGGGGCATGCCCTTCATGCTTTTATAAACGGGAAACTTGCAGGGAGTGGAACTGGTAATAGTGGCAATGCTAAGGTTAAAGTGGATATTCCTGTCACTGTTGGACCTGGGAAGAACACAATTGATCTCCTGAGTTTGACTGTAGGACTGCAGAACTATGGAGCCTTTTTTGACCTAGTGGGGGCAGGGATCACTGGTCCAGTGAAGCTTAATGGTTTAAAGAACGGTAGCAGCATCGATCTCTCCTCACAGCAGTGGATGTATCAGGTTGGACTTAAAGGGGAAGATTTAGGTCTACCAAGTGGAAGTTCATCACAATGGATCTCAAAATCGACCTTGCCCAAGAATCAACCCCTGATTTGGTACAAGACAAATTTTGATGCCCCAGCTGGAAATGACGCAATTGCTTTAGACTTCACTGGGATGGGGAAGGGTGAGGCATGGGTGAATGGACAGAGCATTGGACGTTATTGGCCAGCCTACGTCTCTCGAAGTGGTGGCTGTACTGACTCTTGTAACTATAGAGGATCTTATAATTCAAACAAATGCCTCAAGAATTGTGGGAAGCCATCTCAGCAGTTGTACCATGTACCCCGTTCATGGTTGCAACCAAGTGGCAACATTCTTGTTTTGTTTGAGGAACTTGGTGGGGATCCGACACAGCTTGCTTTTGCAACCAGACAGATGGGAAGTTTGTGCTCACATGTATCAGAATCTCACCCATTACCTGTAGATATGTGGAGTTCAGATTCAAAAACAGGAAGGACTTCAAGCCCTATCCTATCCCTGGTTTGCCCATCTCCAAATCAGGTTATTTCTTCAATCAAATTTGCAAGTTTTGGAACTCCTCGTGGGACTTGTGGTAGTTTTAGCCATGGCAGGTGTAGCAGTGTTAGGGCACTCTCCATCGTACAGAAGGCTTGCACTGGATCGACAAGATGTAGTATTGGAGTATCAACTAGTACATTTGGTGACCCTTGCAAAGGAGTCATGAAGAGCTTAGCTGTAGAAGTTTCCTGTACATGA |
Protein: MGSRTKTLVLVFWLVTATTSFAATVTYDHRAIVIDGKRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRNDLVKFIKLVAEAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIQLRTDNEPFKAEMQRFTAKIVAMMKQENLYASQGGPIILSQIENEYGNIDSSYGAAAKRYIKWAAGMAVSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNKKPKMWTENWTGWFLSFGGAVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEYGHVRQPKWGHLRDVHKAIKLCEEALIATDPTISSLGPNLESAVYKTGSGLCAAFLANVGTQSDATVNFDGSSYHLPAWSVSILPDCKNVVLNTAKINSMTVIPSFMHEPLNINADSTEAIGTSWSWVYEPVGISKADAFKKLGLLEQINTTADKSDYLWYSFSTDIEGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGNSGNAKVKVDIPVTVGPGKNTIDLLSLTVGLQNYGAFFDLVGAGITGPVKLNGLKNGSSIDLSSQQWMYQVGLKGEDLGLPSGSSSQWISKSTLPKNQPLIWYKTNFDAPAGNDAIALDFTGMGKGEAWVNGQSIGRYWPAYVSRSGGCTDSCNYRGSYNSNKCLKNCGKPSQQLYHVPRSWLQPSGNILVLFEELGGDPTQLAFATRQMGSLCSHVSESHPLPVDMWSSDSKTGRTSSPILSLVCPSPNQVISSIKFASFGTPRGTCGSFSHGRCSSVRALSIVQKACTGSTRCSIGVSTSTFGDPCKGVMKSLAVEVSCT |